Learning privately from multiparty data
نویسندگان
چکیده
Learning a classifier from private data collected by multiple parties is an important problem that has many potential applications. How can we build an accurate and differentially private global classifier by combining locally-trained classifiers from different parties, without access to any party’s private data? We propose to transfer the ‘knowledge’ of the local classifier ensemble by first creating labeled data from auxiliary unlabeled data, and then train a global -differentially private classifier. We show that majority voting is too sensitive and therefore propose a new risk weighted by class probabilities estimated from the ensemble. Relative to a non-private solution, our private solution has a generalization error bounded by O( −2M−2) where M is the number of parties. This allows strong privacy without performance loss whenM is large, such as in crowdsensing applications. We demonstrate the performance of our method with realistic tasks of activity recognition, network intrusion detection, and malicious URL detection.
منابع مشابه
Secure multiparty computation of a comparison problem
Private comparison is fundamental to secure multiparty computation. In this study, we propose novel protocols to privately determine [Formula: see text], or [Formula: see text] in one execution. First, a 0-1-vector encoding method is introduced to encode a number into a vector, and the Goldwasser-Micali encryption scheme is used to compare integers privately. Then, we propose a protocol by usin...
متن کاملFast Steganography-based Multi-Party Protocols for Privacy-Preserving Association Rule Mining in Vertically Partitioned Data
Recently, with the emergence of privacy issues in data mining, considerable research has focused on developing new data mining algorithms that incorporate privacy constraints, and, in the same time, are as efficient as possible in terms of accuracy of the results. In this paper, we focus on privately mining association rules in vertically partitioned data, and propose two steganography-based mu...
متن کاملMultiparty Cloud Computation
With the increasing popularity of the cloud, clients oursource their data to clouds in order to take advantage of unlimited virtualized storage space and the low management cost. Such trend prompts the privately oursourcing computation, called multiparty cloud computation (MCC): Given k clients storing their data in the cloud, how can they perform the joint functionality by contributing their p...
متن کاملLiterature Survey on Secure Multiparty Anonymous Data Sharing
The popularity of internet as a communication medium whether for personal or business requires anonymous communication in various ways. Businesses also have legitimate reasons to make communication anonymous and avoid the consequences of identity revelation. The problem of sharing privately held data so that the individuals who are the subjects of the data cannot be identified has been research...
متن کاملPrivate Learning on Networks
Continual data collection and widespread deployment of machine learning algorithms, particularly the distributed variants, have raised new privacy challenges. In a distributed machine learning scenario, the dataset is stored among several machines and they solve a distributed optimization problem to collectively learn the underlying model. We present a secure multiparty computation inspired pri...
متن کامل